Mining Strongly Correlated Sub-graph Patterns by Considering Weight and Support Constraints

نویسندگان

  • Gangin Lee
  • Unil Yun
چکیده

Frequent graph mining is one of famous data mining fields that receive the most attention, and its importance has been raised continually as recent databases in the real world become more complicated. Weighted frequent graph mining is an approach for applying importance of objects in the real world to the graph mining, and numerous studies related to this have been conducted so far. However, all of the results obtained from this approach do not become actually useful information, and a significant portion of them may be meaningless ones even though they are weighted frequent sub-graph patterns. To overcome this problem, in this paper, we propose a novel method which can consider whether any sub-graph pattern has close correlation among elements in the pattern, called MSCG (Mining Strongly Correlated sub-Graph). In experimental results, we demonstrate that our MSCG outperforms a state-ofthe-art method with respect to runtime and memory usage.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Smallest Valid Extension-Based Efficient, Rare Graph Pattern Mining, Considering Length-Decreasing Support Constraints and Symmetry Characteristics of Graphs

Frequent graph mining has been proposed to find interesting patterns (i.e., frequent sub-graphs) from databases composed of graph transaction data, which can effectively express complex and large data in the real world. In addition, various applications for graph mining have been suggested. Traditional graph pattern mining methods use a single minimum support threshold factor in order to check ...

متن کامل

Birds Bring Flues? Mining Frequent and High Weighted Cliques from Birds Migration Networks

Recent advances in satellite tracking technologies can provide huge amount of data for biologists to understand continuous long movement patterns of wild bird species. In particular, highly correlated habitat areas are of great biological interests. Biologists can use this information to strive potential ways for controlling highly pathogenic avian influenza. We convert these biological problem...

متن کامل

Application of simulated annealing for optimization of blasting costs due to air overpressure constraints in open-pit mines

Estimating the costs of blasting operations is an important parameter in open-pit mining. Blasting and rock fragmentation depend on two groups of variables. The first group consists of mass properties, which are uncontrollable, and the second one is the drill-and-blast design parameters, which can be controlled and optimized. The design parameters include burden, spacing, hole length, hole diam...

متن کامل

Fouille de graphes sous contraintes linguistiques pour l'exploration de grands textes (Graph Mining Under Linguistic Constraints to Explore Large Texts) [in French]

Graph Mining Under Linguistic Constraints to Explore Large Texts In this paper, we propose an approach to explore large texts by highlighting coherent sub-parts. The exploration method relies on a graph representation of the text according to the Hoey linguistic model which allows the selection and the binding of sentences in the graph. Our contribution relates to using graph mining techniques ...

متن کامل

Application of Gap-Constraints Given Sequential Frequent Pattern Mining for Protein Function Prediction

OBJECTIVES Predicting protein function from the protein-protein interaction network is challenging due to its complexity and huge scale of protein interaction process along with inconsistent pattern. Previously proposed methods such as neighbor counting, network analysis, and graph pattern mining has predicted functions by calculating the rules and probability of patterns inside network. Althou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013